PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_544_f_1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HD-ZIP
Protein Properties Length: 819aa    MW: 89250.7 Da    PI: 6.3569
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_544_f_1genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.47.9e-21109164156
                   TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
      Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                   +++ +++t++q++eLe+lF+++++p++++r eL+++l L++rqVk+WFqNrR+++k
  Neem_544_f_1 109 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMK 164
                   688999***********************************************999 PP

2START209.61.2e-653185421206
                   HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEE CS
         START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlev 84 
                   ela++a++elvk+a+ +ep+W +s     e +n++e++++f++  +     + +ea+r+sg+v+ ++  lve+l+d + +W e+++    + +t++v
  Neem_544_f_1 318 ELALAAMDELVKMAQTDEPLWIRSLeggrEVLNREEYMRTFTPCIGmkptgFVTEASRESGMVIINSLALVETLMDPN-RWAEMFPcmiaRTSTTDV 413
                   5899**************************************99989*******************************.****************** PP

                   ECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--S CS
         START  85 issg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkg 174
                   is+g      galqlm aelq+lsplvp R++ f+R+++q+ + +w++vdvS+d  ++ ++ + +v +++lpSg+++++++ng+skvtwveh+++++
  Neem_544_f_1 414 ISNGmggtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAESVWAVVDVSIDTIRETSSAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDE 510
                   **********************************************************999************************************ PP

                   SXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
         START 175 rlphwllrslvksglaegaktwvatlqrqcek 206
                   +++h+l+++l++sg+ +ga++wvatlqrqce+
  Neem_544_f_1 511 SQVHQLYKPLISSGMGFGAQRWVATLQRQCEC 542
                   ******************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.2E-2092166IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.4E-2195166IPR009057Homeodomain-like
PROSITE profilePS5007117.216106166IPR001356Homeobox domain
SMARTSM003891.7E-17107170IPR001356Homeobox domain
CDDcd000862.02E-18108166No hitNo description
PfamPF000462.2E-18109164IPR001356Homeobox domain
PROSITE patternPS000270141164IPR017970Homeobox, conserved site
PROSITE profilePS5084844.353309545IPR002913START domain
SuperFamilySSF559612.7E-35311542No hitNo description
CDDcd088758.12E-129313541No hitNo description
PfamPF018521.9E-57318542IPR002913START domain
SMARTSM002341.9E-48318542IPR002913START domain
SuperFamilySSF559611.51E-21570812No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009827Biological Processplant-type cell wall modification
GO:0042335Biological Processcuticle development
GO:0043481Biological Processanthocyanin accumulation in tissues in response to UV light
GO:0048765Biological Processroot hair cell differentiation
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 819 aa     Download sequence    Send to blast
MSFGGFLDNS SGGGGARIVA DISYSNNMPT TTLAQPRLVT PSLTKSMFNS PGLSLALIIK  60
VTCLEWGENF ESVAGRRSRE EEHESRSGSE NMDGASGDDQ DAADNPPRKK RYHRHTPQQI  120
QELEALFKEC PHPDEKQRLE LSRRLCLETR QVKFWFQNRR TQMKTQLERH ENSLLRQEND  180
KLRAENMSIR DAMRNPICTN CGGPAIIGDI SLEEQHLRIE NARLKDELDR VCALAGKFLG  240
RPISSLATSM APPMPNSSLE LGVGSNGFGG LSPVTTTLPL GPDFGGGISN PLPVMPPNRS  300
TAGVTGLDRS IERSMFLELA LAAMDELVKM AQTDEPLWIR SLEGGREVLN REEYMRTFTP  360
CIGMKPTGFV TEASRESGMV IINSLALVET LMDPNRWAEM FPCMIARTST TDVISNGMGG  420
TRNGALQLMH AELQVLSPLV PVREVNFLRF CKQHAESVWA VVDVSIDTIR ETSSAPTFVN  480
CRRLPSGCVV QDMPNGYSKV TWVEHAEYDE SQVHQLYKPL ISSGMGFGAQ RWVATLQRQC  540
ECLAILMSST VSARDHTAIT AGGRRSMLKL AQRMTDNFCA GVCASTVHKW NKLNAGNVDE  600
DVRVMTRKSV DDPGEPPGIV LSAATSVWLP VSPQRLFDFL RDERLRSEWD ILSNGGPMQE  660
MAHIAKGQDH GNCVSLLRAS FDSRNAVKAM NANQSSMLIL QETCIDAAGS LVVYAPVDIP  720
AMHVVMNGGD SAYVALLPSG FAIVPDGPGS RGPMANGPTS GNGGNGGSPR VSGSLLTVAF  780
QILVNSLPTA KLTVESVETV NNLISCTVQK IKAALQCES
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00421DAPTransfer from AT4G00730Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAM4593071e-39AM459307.2 Vitis vinifera contig VV78X101111.6, whole genome shotgun sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010661562.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A061DTK70.0A0A061DTK7_THECC; HD domain class transcription factor isoform 2
STRINGVIT_15s0048g02000.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM112827105
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein